Confidence measures in speech recognition based on probability distribution of likelihoods

نویسندگان

  • Joel Pinto
  • R. N. V. Sitaram
چکیده

In this paper, we propose two confidence measures (CMs) in speech recognition: one based on acoustic likelihood and the other based on phone duration. For a decoded speech frame aligned to an HMM state, the CM based on acoustic likelihood depends on the relative position of its output likelihood value in the probability distribution of likelihood value in that particular state. The CM of whole phone is the geometric mean of CMs of all frames in it. The CM based on duration depends on the deviation of the observed duration from the expected duration of the recognized phone. The two CMs are combined using weighted geometric mean to obtain a hybrid phone CM. The hybrid CM shows significant improvement over the CM based on time normalized log-likelihood score. On TI-digits database, at 20% false acceptance rate, the normalized acoustic log-likelihood based CM has a detection rate of 83.8% while the hybrid CM has a detection rate of 92.4%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Using Entropy Information to Improve Posterior Probability-Based Confidence Measures

In this paper, we propose a novel approach that reduces the confidence error rate of traditional posterior probability-based confidence measures in large vocabulary continuous speech recognition systems. The method enhances the discriminability of confidence measures by applying entropy information to the posterior probability-based confidence measures of word hypotheses. The experiments conduc...

متن کامل

A Word-spotting Hypothesis Testing for Accepting/Rejecting Continuous Speech Recognition Output

The word rejection problem in speech recognition is formulated in a framework of word-spotting, where a spotted word is verified through a binary, acceptance/rejection decision. A generalized word posterior probability (GWPP), used as the sole confidence measure, is computed in a word graph, via the forward-backward algorithm or in an N-best list, using string likelihoods. The GWPP is further e...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

Knowing What You Don’t Know: Roles for Confidence Measures in Automatic Speech Recognition

The development of reliable measures of confidence for the decoding of speech sounds by machine has the potential to greatly enhance the ‘state-of-the-art’ in the field of automatic speech recognition (ASR). This dissertation describes the derivation of several complimentary confidence measures from a so-called acceptor hidden Markov model (HMM) based large vocabulary continuous speech recognit...

متن کامل

Confidence measures for hybrid HMM/ANN speech recognition

In this paper we introduce four acoustic confidence measures which are derived from the output of a hybrid HMM/ANN large vocabulary continuous speech recognition system. These confidence measures, based on local posterior probability estimates computed by an ANN, are evaluated at both phone and word levels, using the North American Business News corpus.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005